Characterising vowel phonation by fundamental spectral normalisation of LX-waveforms

نویسندگان

  • Christopher J. Moore
  • Nicholas Slevin
  • S. Winstanley
چکیده

Objective spectral reference standards for normal vowel phonation are described. Application in larynx cancer monitoring is envisaged. 120 individuals contributed to a database of vowels lad and /i/ in the form of trans-larynx impedance time series captured using an electrolaryngograph. The impedance signals are used for iterated power spectral estimation followed by spectral intra-pooling for each individual. Pooling of spectra across many individuals is complicated by significant variations in the precise frequencies and powers of the fundamental, harmonics and any other characteristic peaks that may be present. Fundamental-harmonic normalisation (FHN) of individual spectra circumvents these obstacles by normalising all powers relative to that of the fundamental and by transforming the entire frequency range into floating point multiples of the fundamental-frequency fO. Population pooling then results in stable FHN-spectral patterns and associated characteristic distributions of fo values. For females it is FHN-spectral pattern that matters most. For males the fO distribution is also highly characteristic. Introduction The choice of medical treatment, driven by economic considerations and issues of accepted practice, is increasingly based on objectively assessed outcome. A particular example driving this study is the competition between radiotherapy and surgery as treatment of choice for larynx cancer. Since both techniques are equally successful for local control of the disease, quality of life after treatment has become an influencing factor. In this context radiotherapy has obvious advantages for the maintenance of vocal fold function compared to the surgical removal of tissues in laryngectomy. However, conservative and reconstructive surgical techniques now claim to offer similar advantages. The result is a heightened interest in the relative degree of conservation offered by the two therapeutic modalities. To the patient the most direct evidence of normal vocal fold functionality is 'voice quality'. To the expert such evidence is to be found in measures of glottal waveform that are free from the complicating factors introduced by tract resonance. In tum this has spotlighted the lack of concise but characteristically detailed objective reference standards for normal glottal waveforms, or for that matter voice quality, against which a patient can be compared. Pilot studies for assessing patient vocal fold function before and at intervals after radiotherapy have already shown promising results, even though they are based on relatively unrefmed standards (1,2]. This report describes what is believed to be significant progress in the generation of improved standards that will underpin further clinical investigations Theoretical Background Speech and language therapists, SALTs, subjectively assess and score voice quality using a range of parameters (3]. Many parameters are simply descriptive, e.g. whisper, although some have been adopted from the physical sciences to produce a hybrid terminology, e.g. fundamental frequency and shimmer (rather than variance). All are capable of interpretation in terms of spectral content in the frequency domain [4]. The periodic glottal waveform produced by the vibrating vocal folds of the larynx is the driving force behind the production of the complex human acoustic waveform. As such the glottal waveform is an indication of the functional integrity of the vocal folds which, as already explained, is of particular interest in the management of larynx cancer. In this and the wider medical context SALTs increasingly supplement their assessments with objective measurements that are closely correlated with glottal waveform, in particular those from electrolaryngography [5]. MAVEBA 1999, Firenze, Italy 1 Models and Analysis of Vocal Emissions for Biomedical Applications

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stimulated production of vowel-like LX-waveforms and spectral damping in the absence of phonation

Electrical impedance 'LX' waveforms measured across the neck at the thyroid level during phonation are known to be correlated with vocal fold movement. Changes in vocal fold contact are thought to be the cause of this phenomenon though emerging applications in radiotherapy indicate that changes in the configuration of both fold and neck tissues are correlated with LX waveform shape. In this pap...

متن کامل

Paralinguistic variation and invariance in the characteristic frequencies of vowels.

It is shown that within-speaker variations in vocal effort and phonation affect fundamental frequency (F0) and the formant frequencies of vowels in the sense of a linear compression/expansion of the spectral separations between them, given an adequate scaling of pitch. Between-speaker variations in size correspond to a translation of the spectral peaks shaped by F0 and the formants if pitch is ...

متن کامل

Temporal control and compensation for perturbed voicing feedback.

Previous research employing a real-time auditory perturbation paradigm has shown that talkers monitor their own speech attributes such as fundamental frequency, vowel intensity, vowel formants, and fricative noise as part of speech motor control. In the case of vowel formants or fricative noise, what was manipulated is spectral information about the filter function of the vocal tract. However, ...

متن کامل

Spectrum factors relevant to phonetogram measurement.

Phonetograms showing the sound-pressure level (SPL) in loudest and softest possible phonation are frequently used in some voice clinics as an aid for describing the status of voice function. Spectrum analysis of the vowel /a/ produced by ten females and ten males with healthy, untrained voices revealed that the fundamental was mostly the strongest spectrum partial in soft phonation while the lo...

متن کامل

Analysis of voice production in breathy, normal and pressed phonation by comparing inverse filtering and videokymography

The present study addresses comparison of two analysis methods of voice production, inverse filtering and videokymography (VKG). Speech data were collected from two male speakers using sustained phonation during laryngoscopy (sound corresponding approximately to the vowel /ä/). The type of phonation was varied between breathy, normal, and pressed. From the waveforms given by inverse filtering, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999